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^ (57) Abstract: A method is disclosed for requesting genomics services from a service provider over the Internet and providing 
^ genomics .services to a client over the Internet. The client provides biological samples and identifies genome sequences of intercst. 
_ The Sei^ice provider obtains the biological samples and genome sequences, provides microarrays containing the identified genome 

sequences, and applies the biological samples lo the microarrays. The client receives the results of the analysis over the IniemeL 
Q The genomics services provided include genoiyping, gene expression, and proieomics. The Qient is able over the Internet lo obtain 

the current status of the samples and the experiments and to modify the requested experiments based, for example, on the analysis 
^ of previously obtained results. 
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Process For Requesting Biological Experiments and 
FOR THE Delivery op Experimental Information 

Field of the Invention 

5 

This invention relates to processes for requesting experiments involving macromolecular 
recognition events and for obtaining and analyzing the information obtained firom such experiments. 
More particularly, this invention relates to a process for requesting genomics information over the 
Internet, tracking and possibly modifying that request, and obtaining the results over the Internet for 
10 analysis. 

Glossary of Biological Terms 

As the following words and phrases relate to this invention, they are to be understood as 
follows: 

15 By ^^experiment" is meant a biological or chemical e}q>eriment involving at least one specific 

macromolecular recognition event An experiment is defined by a particular assay and at least one 

sample on which the assay is to be performed. 

By "macromolecular recognition evenf * is meant chemical interaction (colvalent or 

noncovalent bond) of a macromolecule or a mixture comprising at least one macromolecxile such 
20 as DNA, RNA, polypeptide, or protein on one side and another organic or inorganic molecule on the 

other side. 

iSy "capture probe," **probe moleculc,"or "probe** is meant a specific molecule involved in 
at least one macromolecular recognition event or a mixture comprising at least one molecule that is 
involved in at least one macromolecular recognition event. Examples of probes include PGR 

25 primers, oligomer probes immobilized on array surfaces or to other probes, and flourescently labeled 
antibodies. Probes are meant to include nucleic acids, DNA, RNA, receptors, ligands, antibodies, 
anti-antibodies, antigens, proteins, and also small chemical compounds such as drugs, haptens, or 
peptides. The informational content of macromolecular probes may be defined by their sequences, 
e.g., a sequence of nucleic acids for DNA probes or a sequence of amino acids for peptide probes. 

30 By "experimental protocol** is meant a description of steps taken in the course of performing 

a particular experiment An experimental protocol may be very specific, describing the detailed 
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quantities of reagents used and the detailed steps, or it may be defined generally as. for example, 
"perform PGR" or "perform DNA hybridization." An experimental protocol does not include the 
specification of the informational content of probes involved in macromolecular recognition events. 
For example, while a particular protocol for performing PGR may specify the approximate lengths 
ofDNAprimersorthenumberofdistinctprimers, it does not specify primer sequences. Similarly, 
a protocol for DNA Jiybridization may specify the approximate length of oligomerprobes used, but 

it does not specify their sequence. 

By "assay" is meant a set of instructions for carrying out an experiment on any particular 
sample, including the experimental protocol and the specification of probes used in the context of 
the protocol. An assay does not include the specification of samples on which the experiment is to 
be performed. 

By'tarset molecules or target analyte-ismeant the moleculesofinterestinasubstance that 

are to be interrogated by binding to the capture probes immobilized in an array. 

By "genotyping" is meant the process of analyzing the genetic makeq) or genetic 
constitution of the studied organism. For instance, genotyping is used to locate the presence of 
specific base changes within a given individual's DNA sequence. Genotyping may be used, for 
example, to gather information regarding an individual's predisposition to a particular health 
condition or to confirm a diagnosis of genetic disease. 

The term "phenotype" refers to the observable properties of an organism resulting fiom an 
interaction between the organism's genotype and the environment 

By 'TCR" is meant atechnique for synthesizing large quantities of specific DNA segments 
consisting of a series of repetitive cycles, including denaturation of the double-sti^ded DNA 
molecule, annealing of oUgonucleotide DNA primers, and DNA polymerization. 

By 'TCRprimer" is meant an oUgonucleotide used to bind a complementary single-stianded 
DNA molecule and prime synthesis of flje DNA complementary strand by DNA polymerase. 

The term "array" refers to a spatial grouping or an airangement. 

The term "probe array" refers to the array of different biosites deposited on a reaction 
substrate that serve to interrogate mixhires of target molecules or multiple sites on a single target 
molecule administered to the surface of the array. 

By "array fabrication" is meant the process of preparing a substrate surface and depositing 
the appropriate probe to that surfeceinaparticular spatial grouping or arrangement through th^ 
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Of many different mechanisms. e.g.. ink-jet deposition, capillary, and photolithography. The term 
"oligonucleotide probe arrays" refers to the probe arrays wherein the probes are constructed of 
nucleic acid. 

By "throughput is meant the number of analyses completed in a given unit of time. 
By "SNP" is meant Single Nucleotide Polymorphism. 

By "mRNA target molecule" or "mRNA target analyte" is meant a substance containing 
identical mRNA components or a mixture of disparate mRNAs. 

By "gene expression" is meant a multi-step process, inchiding regulation of that process, by 
which thepioductofagene is synthesized. Moreimticularly.theDNA sequence ofageneisu^ 
totranscribeacomplementaiyRNAsequence.andtheRNAsequenceisthentranslatedintopro^^^ 
which is responsible for carrying out nomial ceUular fimction. The study of gene expression 
generally refers to the analysis of RNA levels within a particular biological sample. SpedficaUy. 
KNA is an interaiediary species in the process of gene expression. As such, the relative level of 
RNA generally provides a proper indicator of the level of gene expression. Studying the level of 
gene expression provides a valuable method to analyze the effects of various compounds upon a 
biological syst^. 

By "gene expression stody" is meant an analysis of the process by which the product of a 
gene is synthesized. Specifically, the study detects the presence of an RNA mol««ile synthesized 
from the geneofinterest. The analysis canbeachievedbyoneoflhemanydifferentmethodsfa^^ 

to those skilled in the art 

By "genotyping study" is meant an analysis of the genetic makeup of an organism. 
Specifically.thestudy detects genetic variation. Theanalysis includes, withoutl^^^ 
ofSingleNucleotidePolymorphisms.DNA8equencedeletions.DNA sequence translocations.DNA 

sequence rearrangements, and DNA sequence repeats. 

By ♦•pharmacogenetics" is meant the study of the biochemical and physiologic aspects of 
dnigeffects.includingwithoutlimitation.absorption.distribution. metabolism.^ 
and mechanisms of drug action in relation to the genetic makeup of the organism. 

By "genetic epidemiology" is meant the study the relationship between genetics and disease 

in populations. 

By"allele" is meant alternative formsofagene. Thus, an aUelecan represent one of several 
possible mutational forms of a gene, which is positioned at the same location on a chromosome. 
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The term "allele scores" refers to data defining the particular alleles detected upon analysis. 
An allele score may represent, without limitation, the number of specific alleles of interest detected 
within a single organism, or the number of organisms within a given study that have the specific 

allele or alleles of interest 

The terai "expression scores" refers to data defining the expression levels of a gene, or set 

of genes. 

The temi "dot-scoring" refers to themeasurementofasignal that comes ftom the interaction 

of a probe with target molecules. 

The term "bio-scoring" refers to the biologically meaningful quantities and any other 
biologically meaningful information extracted from dot scores. Examples of bioscores include 
expression scores and allele scores. 

The term "hybridization" is meant to include, without limitation, a means of two or more 
components to interact through hybridization, an association, linking, coupling, chemical bonding, 
covalent association, lock and key association, or reaction. For the purposes of this invention, tiiese 
terms are used interchangeably. 

By "material kitting" is meant the process of putting togetiier flie necessary materials 
required to conduct a particular step in a study, including without limitation, array fabrication and 
CR/Labeling. 

By "bar code" is meant any recognizable identifier tiiat corresponds to infoimation that may 
be digitaUy stored and referenced. For example, one example of a bar code is the representation of 
characters by sets of parallel bars of varying thickness and separation tiiat may be read optically by 
transverse scanning. 

Background 

[.imitations gf Traditiona l Kinlomcal Research 

Traditional biological research is plagued by numerous technical as well as logistical 
problems. For example, genotyping and gene expression requires a large capital investtnent as well 
as a high level of labor and technical . expertise. Another problem involves a researcher's inabiHty 
to master relevant protocols in biological research. In particular, while a researcher's strengtti 
primarily centers about an ability to envision new relevant types of experiments, most of a 
researcher'sproductivetimeistypicallyspentraasteringcoraplexexperimentalprotocols./^^ 
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techniques such as microarray fabrication for gene expression analysis. SNP detection, and 
proteomics typically require significant up-firont costs, expensive reagents, and lengthy and costly 
training. These problems are exacerbated by an increasing rate of innovation that causes existing 

protocols to become obsolete at an ever increasing pace. 

Asecondpn>blem is the researcher'sinability to procure the specificnumberoftest samples 

necessaiy to perform a statistically meaningfiil population study. In the field of human study, this 
particularproblemisexacerbatedby ethical and privacyconcems and related institutional pio^ 

which generally restrict researchers to a limited number of unadulterated human samples. As 
compared to academic research, commercial researchers may incur even greater restrictions upon 
their ability to test a statistically significant number of unadulterated samples. As a result of 
problemsassociatedwiUisamplecoUection,mostbiologicalreseatchisperfoimedusingmvi/^ 

and tissue cultures. In vitro samples, however, are well known to include large numbers of genetic 
variations or mutations. Experimental data obtained with these samples are firequentiy not 
statistically relevant to ttie population as whole, or even a sub-population. 

A third problem is a researcher's inabiUty to analyze the large data sets produced by the 
researcher'sexperiments-Althougharesearcheristypicdlyquiteadeptatpladngexperimen 

in the context of existing knowledge, the amount of data at the researcher's disposal may be 
overwhebning. This problem is exacerbated by two recent trends: (I) high-throughput microanay 
experiments that provide vast information about DNA polymorphisms, gene expression, and 
proteins, and (2) the grov^rth of publicly accessible data sets, including tiie exponential growth of 
DNA sequence, SNP polymorphism, and gene expression information. 

It is tiierefore very desirable to provide a researcher performing basic and applied research 
a process for requesting tiiat specific assays, including specific probes, be designed and made 
available to the researcher for carrying out experiments. In addition, it is desired fliat aresearcher 
be provided tools to access, search, and request specific samples fi»m a biological repository {Le.. 
DNArepository)anditsassociatedrepository database, including relevantmedical documentation. 

Providing the researcher with fest and simple access to experimental services, a biological 
repository database, and analytical tools wUl lead to a rapid accumulation of genetic knowledge. 
This would greatly benefit the development of personalized tiierapeutics, the future of medicine. 
Such a process would enable researchers to study tiiousands of samples and analyze the fimctional 
relevance of individual genetic variations through Unkage to clinical outcomes. 
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Genomics 

Genomics refers to the study of genes and their function. Recent advances in genomics aie 
bringingaboutarevolutioninourunderstandingofthemolecularmechanismsofdisease.incta^^^^ 

the complex-interplay of genetic and environmental factors. Genomics is also stimulating the 

discoveiyofbreakthroughhealthcanspmductsbyieveahngthousandsofnew 

thedevelopmentofdn»gs.andbygivingscientistsimi6vativewaystodesignnewdnip^ 

and DNA diagnostics. Genomics based therapeutics include traditional small chemical drugs. 

protein drugs, and gene therapy. 

Genetic information provides the basis for understanding biological and physiological 
functions in organisms. This infonnation is encoded in deoxyribonucleic add (DNA), which is the 
basis of the inherited characteristics in organisms. Each DNA molecule is comprised of a 
combinationoffournucleotide bases, adenine, guanine, cytosine, and thymine. wMcharegen^^ 

referred to by the letters A. G. C. and T. respectively. Each nucleotide base is an^ged along one 
of two complementary strands inaDNAmolecule.eachApairingwithTandeachGpair^^ 
C witheachsuchpairbeingcalledabasepair. The order ofthe bases along a DNA strand is called 
the DNA sequence, and the entire DNA content of an organism is caUed its genoipe. Witiiin the 
genome aredefmedDNAsequencesthatfomiparticularunitsofheredity.Theseunitsaregeaera^^ 
referred to as genes. Differences ofone or more bases in a gene, referred to as genetic variation, can 

•modify how a gene functions by altering tiie protein it encodes, which alters the protein's normal 
effect on cell fiinction.Additionally.oneor more basechanges can occuroutsideofageneyet still 

effect cellfimctionbyaltKingtiiatgene'snomiallevelof gene expression. Basechangestiiateff^^ 

geneexpi^ionmay be found intiiegeneitself.orinregulatoryelementsoutsideoftheg«^^^ 

complicated genetic variations may involvebase changes inafirst gene, wherebythat base 

ultimately effects the expression of a critical second gene. Genetic variations arc a primary 
component of all major diseases, including cancer, diabetes, and cardiovascular disease. 

The most common types of genetic variation between individuals are Single Nucleotide 
Polymorphisms, which are referred to as SNPs. A SNP is a change in a single base pair along the 
DNA sequence, such as replacing a T with a G along one strand and its complementary A with a C 
along theother strand. Itis estimated tiiatthereareStolOmillionSNPsinherited from eachpan^^ 

Tens of thousands of SNPs must be measured in an individual to distinguish those SNPs tiiat are 
medicallyrelevantfromthosetiiataieofnomedicalconsequence.Tobettercomprehendti»eg«^^ 
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basis of disease and its treatment, the effect on biological fiinction caused by individual genetic 
variation and the interplay between multiple genetic variations must be understood. In this regard, 
major efforts are currently underway to discover the genetic SNP variability that exists in the 
population and in turn to use this basic SNP data as a starting point to understand inherited 
personaUzed variation in disease risk and in therapeutic response. 

Hie traditional drug discovery process is expensive, time consuming, risky, and labor 
intensive. Despite expendittires of billions of dollars, only a small fraction of the thousands of 
compounds screened by pharmaceutical companies are developed into commercial drugs. To 
develop new medical treatments and diagnostics based on genetic information, pharmaceutical 
companies require access to information establishing the relevant functional connections between 
genes, disease states, medical history, and pharmaceutical treatments. 

There are six stages in the drug development process, and genomics plays an important role 
in each stage. The first stage is target identification, in which a determination is made whether a 
particular gene is a good target for further investigation. This stage is typically initiated with 
genomic shidies involving DN A sequencing, genetic mapping, and RNA analysis, where RNA. or 
ribonucleic acid, is die nucleic acid that serves as an intermediary for translating DNA sequence 

information into proteins. 

The second stage in the drug development process is target validation, in which an attempt 
is made to demonstrate whether an identified target has an effect on the course of a disease. Target 
validation usesanumber of genomic techniques. includingRNA analysis, proteina^^^^^ 
biology. It is important in this stage tfiat candidate targets be investigated under many different 
experimental conditions to better understand and select optimal drug targets. 

In the third stage, primary screening, drug leads are identified through large-scale testing of 
collections, or libraries, of compounds against validated targets. The goal of this stage is to find 
"hits,"i.e, to find those individual membersoftiie compound Ubrarythatbindto,inWWt.oractiva^ 

aparticular target. High throughput genotyping (tiie study of DNA variation) and gene expression 
(the analysis of gene activity) are needed to enable researchers to screen hundreds of genes. 

The fourth stage is tearfcpftwfemion. in which drugleadstiiat are likely t^ 
therapeutic properties are identified by conducting successive rounds of chemical alterations and 
biological tests. Similar to target vaUdation, a number of techniques are used in the lead 
optimization stage, including protein analysis, cell biology, chemical syntiiesis. and other high 
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throughput experiments. This stage may also involve the genotyping and gene expression of 
compounds for therapeutic activity in animal models of disease. 

TTie fifth stage in the drug development process is preclinical development, in which 
compounds a« tested in cell cultures and in ammals to evaluate their safety, effectiveness. 
distributionthroughoutthebody.andmetaboli2ation.Formulationtestsdeter^ 
method as well as check for consistent quality in manufecturing. Expression analysis provides 
toxicity assessments by indicating the behavior of specific toxicity marker genes. 

The sixth stage in the process is clinical development, in which the safety and efficacy of 
drug leads or pharmaceutical compounds are tested in humans. Because clinical trials are the most 
expensive stage of drug development, phannaceutical companies need to be able to analyze the 
genetic makeup of each patient to detemiine or predict responsiveness or adverse reactions to 
particulardrugs.Genotypingexperimentsmaybeused to identify each individual'slikelyrespo^^ 

to a drug, thus maximizing the productivity of the trial and reducing overall costs. 

Genomics is the key to reducing the risk, cost, and lead time of drug discovery and the 
development of personalized medicine. Pharmaceutical and biotechnology companies are actively 
incorporating genomics into each stage of the dmg development process. Thus, there is a need for 
rapid, large scale, and systematic searches for panels of SNPs. coupled with lifestyle and clinical 
dataforeachpersonproviding asample. Suchmethods will facifitate the discovery of the predictive 
niles that relate individual patient data to disease risk and patient response to phamiaceutical 
therapy. The complexity of the data required, the process of risk factor discovery, and ultimately 
the abiUty to deUver these discoveries into cUnical trials suggest a need for high throughput genetic 
and molecular analysis, high capacity bioinformatics using advanced statistical methodologies, 
automated high throughput sample handling, and large scale sample libraries. 

QirrentmethodsforanalyzinggenomicdataandshidyingSWsaretypicallylowti^^ 

technologies that produce results with limited accmacy due to human error and smaU sample sizes. 
These procedures typicaUy generate single data points per sample per experiment, such as the 
expression levels of a single gene or the identification of a single polymorphism. A significant 
amountofworkis involved inpreparingasample.processingit. and generatingtiiedataforeventual 

analysis. TTie resulting data is not easily reproduced due to cost, time, and sample availability. 
Since the biological samples are often of limited quantity and number, researchers must be carefiil 
to use samples sparingly if more data may be needed for fotaure experiments. 
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Current technologies are also limited by sample storage and tracking. Traditional sample 
storage methods involve storage of liquid materials or frozen tissue specimens and require costly, 
space-consuming, low temperature cryogenic facilities. Scalability of these storage methods is 
difficult and requires larger faciUties and additional fteezers. Further, sample tracking and retrieval 
are problematic since employing a folly robo& system under cryogenic conditions is costly and 
often impractical. 

Thus, there is a need for more productive and more efficient genetic analysis methods that 
are not plagued by the problems associated with sample quality and quantity. To more rapidly 
enhance our genetic understanding, there is a need to provide research entities located in remote 
locations with rapidaccess to biological samplearchives and sample testingservice8.Further,thTO 

is a need for more productive and efficient drug discovery methods and genomic information 
services that willallowpharmaceuticaiandbiotechnologycompaniesto develop significantiyhigher 

numbers of drug compounds each year. 



fUfprtrnmc Data Transfer 

Certain embodiments of the present invention are directed to use over the Internet Hie 
Internet consists of a large number of computers connected via network links and communicating 
via standardized Intemetprotocols to fonnaglobal, distributed network. While the term "T^ 
as used herein is intended to refer to what is commonly known today as the Internet, it is also 
intended to encompass private as weU as pubUc networks and any variations to those networics that 
may exist in the fixhire, including changes and additions to existing standard protocols. 

Computerusers communicate andexchangeinfonnationoverthelntemetusingstandardized 

protocols. The World Wide Web ("WWW^ provides a visual interface to feciUtate this 
communication and exchange of information. The WWW aUows a server computer, having set up 
a web site, to send text and graphical images in the form of web pages to a client computer for 
display or storage. 

Web pages are typically written using Hyper-Text Markup Language CUTML*'). The 
software on the client computer that interprets and executes the commands contained in web pages 
is called a browser. In this context, the "client" denotes the computer using a browser to request and 
display itrfonnationandthe"server''denote8thecompttterresponding to thoserequestsbyprovidin^ 
web pages. A single web page may contain data from a number of different servers. Examples of 
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browsers include the Microsoft Internet Explorer and the Netscape Navigator. The client indicates 
to the browser a desire to view a particular web page by entering that web page' s address, which is 
referred to as its Uniform Resource Locator CmL'*). into the browser. The browser then initiates 
a client computer request to a server asking that it transfer to the client the HTML file that defines 
the requested web page. When the requested web page is received by the cUent, the browser uses 
it to construct a visual image of the web page on the cUent's display monitor or to store a visual 
image of the wdi page on the client's computer. The web page contains various commands for 
displaying text, graphics, controls, background colors, and other display features. In addition, ttie 
web page may contain other URL addresses, called hot links, that point to other web pages at flie 
server-swebsiteorotherwebsites. In addition, web pages may contain form fields or other devices 
that permit the client to transmit data to the server computer and may contain audio or video objects 
as well. A web page may be larger than the image displayed on the client computer's monitor, in 
which case the browser suppUes scroll bars that permit the cUent to view different portions of the 
wdb page. 

Web page description languages other than HTML are either currenUy available or planned 
forfuture release. In addition, various extensions to the basic HTML standardhavebcOT 
to provideadditionalfeatures.Theseextensiotis include WebBotcomponents,Javaapplets,browse^ 
plug-ins. Dynamic HTML ("DHTML"), and ActiveX controls. These extensions. aU well known 
in the art. permit web pages to ofer capabilities and services far beyond that offered by HTML 
alone. Methods for using these and other extensions to design web pages are well known in the art. 

Abrowser communicates with a web server over a transmission link that operates according 
to the Transmission Control Protocol / Memet Protocol ('TCP/IP"). For the majority of Internet 
communications, a browser communicating with a server over a TCP/IP link sends and receives 
infomiation using the Hyper-Text Transfer Protocol ("HTTFO. Most web browsers also enable 
clients to access server resources and services using additional protocols such as File Transfer 
Protocol ("FTP") and Tetoet- 

Communication between a client and server generally takes place over communication links 
such as telephone lines and public network lines that are not inherently secure. Two important 
security issues related to client-server communications are privacy and authentication. Privacy 
involves prohibiting anyone other than the intended recipient from being able to read a 
communication between the client and the server. Privacy is typicaUy accomplished using 
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cm,»gr,pMcme*odsby»hichcommumcaUo,«aree„cryp.edprior.ot«mn»»^^^ 

receipt 0.« popular pro»col fcr pmvidtag an encrypted conunumcaticn Irnlc 
b«we«r d.e server »d U» cU«.. is d,. Secure Soclcets Uyer fSSn pro»co. developed by 
N«scap.Co«».uni.«io»C»n..msprotteoli,WicaU,r..^.oasa.eHTTPSproW.O^ 

security protocols ». av^hble, including ftival. Commumcstions Technology CPCn. Secure 
Hyper-Tex.Tr^ponP».«c.lC™%'««'^'V<3«^'riv«yCTOPn.Me.hodsfor«si.g 

these and other protocols to provide s«n.re Intend co™«ctions « «ell kno,™ m th. «t 

Asecondin,po,tants=curityissoerelatedtoelient.serve.conm«nncationsisauthent«^^ 

Anthentication involves veri^g d«. *c entity with whom a cUen. (or server) is conrnnnucatu* 
isinfi«t«»ac.ualserver(orcli«™.).0„eme.hodofau,hen.icationusesce,,i«c,.esto««^ 

a message. A certificate is a set of digital data th« identifies an entity and verifies that the pubhc 
enctyption ^ signature Iceys induded within .he certificate belong to .ha. entity. Me*ods of 
ptovidingauthenticaBonareweUkiiomintheart. 

O,,ren.ly.theprima.ysumda,dp,otocoufbrallowingapplicaaons» 

document are HTTP «.d HTIPS, «.d the Web pages are encoded using HTML. Howev«. the 
,emB "Web" and -World Wide Web" as used herein are intBrfed to encompass fitture m«tav 
tag„ages«.dtr««por.pro.ocolstha. ^y be used in place ofor in addition U>HMI,Hni>. or 
HTTPS Further. ^ wm "Wd," as used herein is a broad tenn including, for example. Ite 
hardware and software server compo«=n.s as well as any non-standard or specialized compon.«B 
that interact with fte server componentt to provide services .0 Web site users. 

TV detaited description of ttie pres«it invention given below is p.««nted largely m terms 
ofp,ocedures,steps.logicdUgran«.»«lothersyniboBcanddcscripdv.represe.«ationstha.d^^^^ 

U,eope,a.ionsofda.aprocessingdevicesconnectedvi.ne««»k.. These process descriptions and 
representations a^themetbodsusedby those experieoc«l«sldlWin.he.rttomostefr»:dvdy 

cOoveyftes«bs»nceoftheirworlctoofl.ersskill.dinfl.ea,t Cettaio steps r«pire the physicd 
manip»lafionofeIectricalsignalstha.arecapableofbeings«,red,..ansferred.combinrf^ 
di,pUyed.oro.l«rwisemanip«la..di«.computersys.emornetwortForconvenie„ceandcto^ 
,hesedectticalsign.Um.ybe«fe.Tedtoa.bit,.vataes.elemenB.symbols. operations, messages. 
,.nns,mnnbers.images.orthelika AU.fa.ese and similar terms are merely convenient labels and 
are to be associ«ed wifl. the appropriue electrical signaU to which they correspond. Unless 
specificaUy stated or ottwwise apparent torn die foUowing description, the terms "processmg," 
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"computing," "displaying." and the like refer to actions and processes of a computing system and 
networkthatmanipulatesandtnmsformsdatarepresentedbyelectrical signals withm 

device'smemory.wheretheterm^'memor/'includesROM.RAM.magneticstora^ 
storage media, and the like. 

SUMMARY OP THE INVENTION 

The present invention is directed toward the advancement of genomic studies by providing 
genomic services to academic and commercial researchers, drug manufacturers, and healthcare 
providers over the Internet The present invention provides methods and systems for requesting 
experimental services torn a service provider and providing experiments servi^^ 
experimental requestconsistsofanassayandaset of samples on whichtheassay is tobeperfom^^ 

Inoneembodiment.theassaymayincludeamultiplicityofprobes.Someoftheprobesmay 

be immobilized or otherwise identifiable while others, such as PGR primers, may be used in 
solution, m one embodiment, some of the probes used in the assay may be umnobilized on a flat 
surface, such as glass. In another embodiment, some ofthe probes may be immobilized in a three- 
dimensional polymer. In yet another embodiment, some of the probes may be immobilized on bead 
surfaces. In one embodiment, the assay may be specified exactly by the cUent. either by t^^ 

providmg a specific set of probes to be used in the assay or by the client selecting a pre-specified 
assay Inanotherembodiment,alistofpre.specifiedassaysmaybe stored inthe database 

accessible to cUents through relational or keyword based queries or through a hierarchically 

organized catalog. 

In another embodiment, instead of providing a specific assay, the client requests that an 
assay, including a set of particular probes, be designed based on a chent-specified target of the 
experiment. One type of experimental target is a list of genes the client may be interested m 
examining for changes in expression levels, in which case a particular assay, including a set of 
oligonucleotide hybridization probes for use on microarrays. is designed prior to conducting the 
experiment Another typeofexperimentaltargetisaUstofSNP polymorphisms the cUent may be 

interested in examming for variations aciossapatientpopuhition. in which caseaparticulara^^^^ 
i«:ludingasetofoUgonucleotidehybridizationprobes.andasetofPCRprimersforamplify^^^ 
number of polynucleotides containing polymorphic sites is designed prior to conducting the 
experiment In yet another embodiment, the experimental target is a list of proteins, in which case 
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an assay such as a multiplex ELISA assay, including a set of antibodies, is designed prior to 

conducting the experiment. 

m client may provide one or moie biological samples. The service provider then obtains 

the biological samples and genome sequences and carries out an assay by applying at least one of 
the biological samples U> at least one of the microarrays-lliecUent receives dataiep^ 

the results of the appUcation of the biological samples to themicioarrays. The biological samples 
mayincludetissuesamplesorblood samples. Inanother embodiment. theserviceprovider 

arepositoryofbiological samples andprovidesacatalogofthatrepository.Th^ 

the catalog and selects biological samples from the catalog. 

Communication between the client and the service provider may occur over an Internet 
connection. whichmaybeasecureconnection.Inafurther embodiment, thedatarq^ 
the appUcation ofthe biological samples to the microarrays may first be analyzed by the clie^ 

another embodiment the analysis is first performed by the service provider. 

The step of providing biological samples may include transporting the samples fi:om the 
clientto the sendee provider or selectingbiological samples firomarepositoryundert^^ 

the service provider. The data may include gene expression data, and the biological samples may 
includetotalIQ^Asamplesorpoly-ARNAsamples.Inanaltemateembodiment.thecUentp™^^ 

one or more biological samples. The service provider obtams the samples and appUes at least one 
ofthebiological samples to the microarrays.ThecHent receives datarepresentative^ 

Step- 

The present invention overcomes the limitations of current methods of providing genomic 
services byprovidingaWghly integrated on-line technology platform that combines ina^^^^^ 

embodiment assay services, a biological repository databa«. vdth high throughput microarray 
analysis, automated sample handling, high capacity bioinfomiatics. and statistical genetics. The 
disclosed platform provides on-line cost effective genotyping. gene expression analysis, and 
proteomics, thereby assisting pharmaceutical and biotechnology companies in their efforts to 
develop safer, more effective therapeutics. While a preferred use of the disclosed platform is to 
develop more effective therapeutics, the scope of the present invention is not to be so limited. 
Specifically.thepresentinventioncanbeusedbyanyclientdesiringtoobtainexperimentalser^^^^ 
By usingthepresentinvention.acUenthas the ability to design anexperimentfromaremote 

location, submit that experiment for processing, and then obtain the results of that experiment 
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delivered to the remote location. Communication to and from the remote location may occur over 
a standard Internet comicction. The experiment is conducted according to an integrated process that 
may inchide integrated purchasing processes for acquiring the necessary materials from outside 
vendors as well as laboiatory sample and status tracking routines. The sample and array tracking 
may be linked intelligently to material requisition software and lab infoimation management 
software. The samples and arrays are identified for tracking, preferably digitaUy indexed^ 
a client to remotely query the status of the subject assay. Further, the present invention includes 
capabiUties permitting a client to remotely modify and redesign previously submitted experiments 
based, for example, ontheresultsofotherorpast experiments. Once theexperiments are perfom^^ 

the researcher may be provided tools to access the experimental data and analyze it in the context 
of biological information accessible through the computer. Once the analysis stage is completed, 
die researcher may be enabled to design and request new experiments that are based on the results 
of the analysis of previously completed experiments. 

Brief Description of the Drawings 

FIG. 1 illustrates a connection between a Client and a Service Provider over the Intemet 
HG. 2 illustrates an embodiment of the present invention directed toward providing 
genotyping services. 

FIG. 3 illustrates an embodiment of the present invention directed toward providing gene 
expression sovices. 

HG. 4 illustrates an embodiment of the present invention directed toward providing 

proteomic services. 

no. 5 illustrates a microarray suitable for use with the present invention. 

FIG. 6 ilhistrates a user interface for analysis software suitable for use with the present 

invention. 

HG. 7 illustrates a division of tasks between a CUent and Service Provider in one 

embodimmt of tiie presaat invwition. 

FIG. 8 iUustrates a data entry screen for entering a genotyping project according to one 

embodiment of the present invention. 

no. 9 illustrates a data viewing screen according to one embodiment of the present 

invention. 
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FIG. 10 illustrates the flow of data according to one embodiment of the present invention. 
HG. 1 1 illustrates the steps involved in a typical genotyping experiment according to one 

embodiment of the present invention. 

FIG. 12 iUustratesaProductionMenuaccordingtooneembodimentofthepresentinvention. 

FIG. 13 iUustrates a Source Plate Importation Screen according to one embodiment of the 

presoit inventioiL 

• FIG. 14 illustrates a Work Order Window according to one embodiment of the present 
invention. 

FIG. 15 illustrates a Plate Detail Window according to one embodiment of the present 
invention. 

FIG. 16 iUustrates a Sample Detail Window according to one embodiment of the present 
invention. 

FIG. 17 illustrates a BioScore Computation Window according to one embodiment of the 
present invention. 

HG. 18 illustrates a Process Flow Chart in accordance with an embodiment of the present 
invention. 

Detailed DESCRIPTION 
Description of Experimental Sorvices 
Thepresent mventioncanbedivided into five aspects. (1) Sample Acquisition; (2) Sequence 
or SNP Identification; (3) Array Fabrication; (4) Data Acquisition; (5) DataDeUvery and Analysis. 
These aspects wiU be explained via their use with respect to four particular embodiments of the 
present invention: (1) G«iotyping based on Client suppUed samples; (2) Genotyping based on 
samples selected from a DNA repository database; (3) Gene Expression based on CU«>t suppUed 

samples; and, (4) Proteomics. 

Referring to HG. 1, Client 105 uses Browser 110 to request Web Pages 100 fiom Service 

Provider 120. This request is made over Internet 1 15 and is processed by Web Server 125. Wd) 
Server 125 may be comiected to DNA Repository Database 210. Web Server 125 is further 
connected to Microartay Process 130 and configured to obtain results fiom that process for 
transmission to CUent 105 as Web Pages 100. The basic fiamework of FIG. 1 is suitable for use 
with each of the fi)llowing four embodiments of the present invention. 
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qpnntvpine: Client Suppl ied Samples 

RefeiTing now to the flow chart of FIG. 2, one embodiment of the present invention occurs 
when the CUent provides its own samples in response to the query in Step 200 of FIG. 2. The CUent 
next chooses the sequences or SNPs of interest in Step 215. (No specific ordering of the steps is 
requii«d unless specificaUy stated. For example. Step 215 could occur before Step 200.) The 
sequences or SNPs that the CUent desires to be analyzed may be submitted to the Service Provider 
via an on-line infoimation system or viasome other means such as an electronic spreadsheet or hard 
copy. The samples and the identified SNPs are submitted in Step 220 to the Service Provider as a 
work order. All interaction between the Client and the Service Provider may be transacted over die 
Internet, preferably usingasecure encrypted protocolsuch as the HTTPS protocol described above. 

The submitted samples are preferably identified by a unique identification label or barcode. 

After sampleshave been acquired and the sequences or SNPsofinteresthavebeenidentified, 

the Service Provider provides suitable capture probes in Step 225. In Step 230. the Service Provider 
designs, fabricates, and validates custommicioarraysandPCRprimers.ThePCRprocessampUfies 

the DNA in Step 235. 

In Step 240. the Service Provider investigates or interrogates the samples using the prepared 
microarray8.ThemicToarraymaybefabricatedusingthecapillarymicroarraymanufacturi^ 
disclosed in U.S. PatentNo. 6,083.763, which disclosure is incorporated herein by reference. Such 
system is capable of printing a microarray at high throughput. Other microarray types or 
hybridization or expression detection systems may be used, as the present invention is not limited 
to the use of any particular platform. A microarray consists of an array of test sites formed on a 
suitable stmcture. Oneexampleofamicroarray isshowninFIG. 5. DNA or RNA capture probes 
of known binding characteristics are attached to each of the test sites. The probes in each test site 
differ from the probes m other test sites in a known manner. A sample containing an uiiknown 
candidate molecule, such as a DNA molecule containing a SNP. is hybridized to the microarray 
containing the capture probe. A signal is applied to the microarray so that certain electrical, 
mechanical, or optical projections of the test sites can be detected. A positive signal indicates that 
a molecular structure present in the sample substance has bound to a certain probe in one or more 
of the test sites of the array. The microarrays may be imaged using the microarray imager also 
disclosed in U.S. Patent No. 6,083,763, which can image an entire plate of 96 microarrays within 
60 seconds. Further, this system may be automated to minimize human enor and permit rapid 
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analysis. Customers may choose to use either standard or custom microarrays. Standard arrays are 
pre-fabricate5d and contain SNPs of general interest, such as, for example, those related to drug 
metabolism or cancer predisposition. 

After the data is obtained, it is deUvered in Step 245 to the Client for analysis in Step 250. 
The delivery in Step245 preferably occurs usingasecurc on-line connectionbetweentheQienta^ 

Service Provider, but it may also occur by alternate methods. In addition, the analysis of the data 
in Step 250preferably occurs subsequent to delivery to the CUent. but itmayoccurprior to deUveiy 
aswell. TheanalysissoftwaremayberesidentintheCustomer'scomputerortheServiceProvider 
may provide on-line analysis that is controlled remotely by the Client. If the analysis is provided 
by the Service Provider, then the CUent may receive none or only a portion of the actual data on 
which the analysis is based. If the data is analyzed by the Client, the analysis may be performed 
using data analysis software provided by the Service Provider or by standard commercial packages 
such as electronic spreadsheet software. The arwdysis software preferably provides large data set 
analysis and visualization functions. A sample user interface for analysis software suitable for use 
with the present invention is diovm in FIG. 6. 

It should also be understood, however, that the presently disclosed on-line technology 
platform is not limited to genotyping services that test only for SNPs. More particularly, a CUent 
may request that genetic variations other than or in addition to SNPsbe tested. For instance, an on- 

line Qient can specifythat the submitted samples, such as cancer specimens, be tested formulti^^^ 

base DNA deletions and insertions using microarray analysis. Alternatively, a CUent can request 
tf»tsubmittedsamplesbeanalyzedforSNPs.deletions.in5ertions,and/orrearrangeme^^^ 
formsof genetic variation and methods of detecting them ate weU known in the art, and the presM^ 

invention is not limited to any particular genetic or biological test 

n^nntvpintr: S i^^i pl^-o fj^lecte d Prnm DNA ReoosUorv Database 

In a second embodunent of the present invention shown in HG. 2, the Client does not 
provide its own samples in Step 200. but instead selects samples from DNA Repository Database 
210 in na 2. Tliough the present embodiment discloses selection of DNA samples from aDNA 
Repository via access to a DNA Repository Database, the present invention is not so Umited. For 
example, biological samples from a repository containing a wide-range of biological samples could 
be used. Such samples could include, without limitation, tissue, blood, skin, and hair. 
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The DNA Repository services are preferably provided directly by the Service Provider, but 
they may be provided directly by a third party or indirectly by the Service Provider as an offeite 
Sample Archive Service. DNA Repository Database 210 preferably includes biological sample 
informatioii as well as an index to attendant clinical records associated with those samples. These 
recordspreferably arebar-coded and devoid of information that could lead to patient identification. 

In one embodiment, theDNAsamples in the DNA Repository are stored on cards according 
to themethod described inProvisionalU-S.PatentApplication60/161.694 filed onlO/26/99. which 

appUcation is being incorporated by reference. When a particular sample is requested, automated 
equipment such as robotics may be used to select and obtain the corresponding DNA sample and 
provide the sample to the microartay for interrogation. However, other methods for DNA sample 
storage and access may be used in accord with the present invention. 

The Client preferably uses a secure Internet connection to search through the sample records 
inDNARepositoryDatabase210. The ServiceProviderprefcrablyprovides access to thed^ 
viasecure WebPages. as described above. More specificdly.thepreferredembodiment permits tt^^ 

Client access to the database, even though tiiat Client is remotely located relative to the DNA 

Repository itself. 

In addition to a catalog of indexed biological samples, the repository database preferably 
includes corresponding clinical records, phenotypic information, and follow-on medical history. 
Following a search or analysis of this and other repository information, the Client selects a subset 
of samples for investigation. Once the Client selects tiie samples, the Client then deagns the 
genotyping study on-line by selecting tiie SNPs of interest in Step 215. The analysis request is 
provided to theServiceProviderasaworkorderinStep 220. The remaining steps in FIG.2proceed 

as in the previous embodiment 

In an alternate embodiment of the present invention, the Service Provider performs 
genotyping analysis oftiie samples in theDNARepositorypriortoarequestforthat analysis by tiie 

Client A Client may later access tiie results of tiiat analysis, preferably using a secure Internet 

connection. 
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CrPne Expres. ^i^n: Client Supplied Samples 

In yet another embodiment, gene expression services may be requested and provided over 
a secure Internet comiection between the Client and Service Provider in a manner similar to that 
previously described for genotyping services. 

Referring to FIG. 3, the CUent submits samples to the Service Provider in Step 300. The 
samples preferably comprise tissue culture cells, total RNAsamples, and/or poly-ARNA samples. 
The Client identifies the SNPs of interest in Step 305 and submits that information to the Service 
Provider along with the samples asawork order inStep310. The submitted samples are prefc^^^ 

identified by a unique identification label or barcode. 

The Service Provider processes the received samples in Step 315 and generates the 
approprialecaptureprobesinStep320.TTieServicePrDviderfabricatessuitable^^^ 

325. In another embodiment, the CUent chooses the microarrays ftom a collection of standard 
prefebricatedmicroarray designs madeavailj*lebytheServiceProvider.TheServic^ 

performs the hybridization in Step 330 to interrogate the samples. 

After the data is obtained, it is delivered in Step 335 to the CUent for analysis in Step 340. 
ThedeliveryinStep335preferablyoccursusingasecureon.UneconnectionbetweenUieaientand 

Service Provider, but it may also occur by alternate methods. In addition, the mAysis of the data 
in Step 340 preferably occurs snbsequenttodeUvery to the Client, but itmayoccurpriorto delivery 
asweU. TheanalysissoftwaremayberesidentmtheCustomer'scomputerortheServiceProvider 
may provide on-line analysis that is controUed remotely by the Client If the analysis is provided 
by the Service Provider, then the CUent may receive none or only a portion of the actual data on 
which the analysis is based. If the data is analyzed by the CUent, the analysis may be performed 
using data analysis softwareprovidedbythe Service Provider or by standard commercial p«^^ 

such as electronic spreadsheet software. The analysis software preferably provides toge data set 
analysis and visualization functions. 



The study of proteins and their fimction is called proteomics. In yet another embodiment, 
proteomic services can be requested and provided over a secure Internet comiection between the 
CUent and ServiceProviderinamamiersimilartothatpreviouslydescribedfor gene expressions^ 

genotyping services. 
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10 



A protein is a biological molecule consisting of a high molecular weight polypeptide chain 
of L-amino acids that is synthesized by living cells. Proteins are required for the structure, fiinction, 
and iBgulation of cells, tissues, and organs in the body. The amino acid sequence of the polypeptide 
chain determines the proper structure and function of the protein. Any variation in the DNA gene 
sequence can cause the wrong amino acid to be synthesized into the polypeptide chain. An amino 
acid change can cause disruption of protein stracture and function, and lead to a disease state. 
Additionally, aberrant gene expression can alter nonmal protein function. For instance, the level of 
protan within a cell can increase when the DNA gene sequence is over-expressed. This ovct- 
expression may be caused by a number of different factors, including drug response and gene 
mutation, bicreases in protein levels can cause severe disruption of ttie increased protein, as well 
as of the function of other proteins. As a result, the over-expressed protein can cause a disease state 
or further exacerbate a disease condition. 

A preferred method for detecting a specific protein ("antigen") is to use an enzyme-linked 
immunosorbent assay ("EUSA"), which is weU known in tiie art An EUSA is useful for 
1 5 determining the presence or amount of antigra in samples ranging from crude cellular lysates to 
highly purified protein antigen preparations. An ELISA is able to detect native proteins or proteins 
with altered conformations. It should be understood by one skilled in the art, however, that various 
other methods of detecting antigens in a biological sample can be used without deviating from the 
scope of the present invention. For instance, ELISA protocols may diffisr significantly depending 
20 upon tiie specific antigen detected. It should be fimher understood that tiie term "antigen" includes 
proteins, drug molecules, viral particles, antibodies, and the Uke. 

Referring to FIG. 4, the Client submits antigen samples to the Service Provida: in Step 400. 
The samples preferably comprise solid tissue, tissue cultiire cells, crude or purified protein lysates, 
and the like. The Client specifies the antigen of interest in Step 405 and submits that information 
25 to the Service Provider along with the samples as a work order in Step 410. 

If necessary, the Service Provider processes the received samples in Step 415. Methods to 
prqwre antigen samples suitable for testing are well known in the art. For example, a purified 
antigen can be prepared using sodium dodecyl sulfiite polyacrylamide gel electrophoresis (SDS- 
PAGE). SDS-PAGE provides a highly purified antigen sample suitable for sensitive detection and 
30 analysis protocols. Where native antigens are desired, a prefened electrophoretic purification 
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method is a nondenaturing or "native" gel. Other suitable mettiods, however, are also well known 
in the art. 

In Step 420, the Service Provider selects a capture probe operable to detect the antigen of 
interest designated by the Client in Step 405. It is well known in the art that the selected capture 
probe depends upon the antigen of interest. For instance, antibodies are generally used to detect a 
widerangeof antigens, and vice versa. Antibodies are also used to detect other antibodies. Other 
combinations of probe based systems for detecting an antigen are well known in the biological arts. 
Using the selected capture probe, the Service Provider fabricates suitable microarrays in Step 425. 
One microarray for ELISA detection can be fabricated according to the description in "High- 
Throughput Microarray-Based Enzyme-Linked Immunosorbent Assay (ELISA)," Mendoza et aL 
BioTechnigues, 27:778-788 (October 1999), which is incorporated by reference herein. In another 
embodiment, the Client chooses the microarrays from a collection of standard prefabricated ELISA 
or other microarrays made available by the Service Provider. The Service Provider performs the 
antigen detection protocol (t.e., ELISA) in Step 430 to interrogate the samples. The bound antigoi 
may be detected according to the Mendoza article. However, other protein detection systems may 
be used to practice the present invention. 

After the data is obtained, it is delivered in Step 435 to the Client for analysis in Step 440. 
The delivery in Step 435 preferably occurs using a secure on-line connection between the Client and 
Service Provide, but it may also occur by alternate methods. In addition, the analysis of the data in 
Step 440 preferably occurs subsequent to delivery to the Client, but it may occur prior to delivery 
as well. The analysis software may be resident in the Customer's computer or the Service Provider 
may provide on-line analysis that is controlled remotely by the Client. If the analysis is provided 
by the Service Provider, then the Client may receive none or only a portion of the actual data on 
which the analysis is based. If the data is analyzed by the Climt, the analysis may be performed 
using data analysis software provided by the Service Provider or by standard commercial packages 
such as electronic spreadsheet software. The analysis software preferably provides large data set 
analysis and visualization functions. 

Description of System Software 
This section describes an embodiment of Systmi Software suitable for use with the present 
invention. The term "System Software** includes all of the software used by the Clxeat to 
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communicate with the Service Provider and obtain and analyze the data provided by the Service 
Provider including if applicable a browser suitable for the receipt of web pages and generic software 
such as an electronic spreadsheet program that may be used to analyze or visualize the received data. 
It also includes the software used by the Service Provider to provide genomics services to the Client 

5 and access to a repository database. The following description may be used by one of skill in the 
art to write System Software in accordance with the present invention. The software may be written 
using a variety of software platforms, but will typically include some HTML code and its various 
extensions for creating web pages supplied to the Client. Although the present invention is herein 
described in terms of various embodiments, it is not intended that the invention be limited to these 

1 0 embodiments. Modification within the spirit of the invention will be apparent to those skilled in the 
art 

Sample Submission 

Software may be provided to permit secure, encrypted online access to genotype and gene 

15 expression information and analysis. The software may additionally provide access to operational 
processing activities and may facilitate the submission of samples and the analysis of results. The 
detailed description of the software system given below is presented largely in terms of procedures, 
steps, logic diagrams, and other symbolic and descriptive representations that depict the operations 
of data processing devices connected via networks. These process descriptions and representations 

20 are the methods used by those experienced or skilled in the art to most effectively convey the 
substance of their work to others skilled in the art. 

Preferably samples are entered into the software system and assigned distinct barcodes prior 
to delivery to the Service Provider. In this way, the samples may be tracked throughout the various 
processes. The System Software will preferably be divided into integrated modules to aid high- 

25 throughput data collection and analysis. One module, for example, may be provided for SNP 
genotyping,phannacogenics, and genetic epidemiology. Another module may be provided for gene 
expression at the mRNA level. Both modules preferably include analytical algorithms, querying 
capabilities, and visualization tools specifically designed for analyzing high-throughput genomics 
data. An export fiinction may be provided to allow a Client to transfer data to its own database. 

30 The software system facilitates communication between the Client and the Service Provider. 

The Client submits samples, creates studies, initiates projects, views raw data, analyzes data, and 
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completes various studies. The Service Provider receives project requests from customers, issues 
work orders, logs completed work orders, and completes projects. Those receiving the work orders 
locate samples and perform operations on the samples. These roles are illustrated in FIG. 7. 
Although this embodim^t has described various steps carried out by the Client and by the Service 
Provider, it would be possible for certain of the steps assigned to the Client to be carried out instead 
by the Service Provider and vice versa. In addition, certain steps could be carried out by a third 
party. 

The following description applies to one embodiment of the system software that could be 
used in accordance with the present invention. This description is presented by way of example and 
is not intended to preclude other embodiments that could occur to one of skill in the art. Not all of 
the following steps will be required in every application. 

The first step in the present embodiment of tfie system software is to obtain login 
information. The Client enters a User Name and a Password to obtain access to the system. Once 
access is granted, the Client is presented with an interfiice environment as, for example, is provided 
by the Microsoft Windows operating system. Access to the system may be granted on a number of 
levels depending upon flie User Name. The contents of top-level and expanded menus may vary 
depending on the functional access permissions granted to a particular user. The level of permission 
may, for example, be based on the user's role in the process. A Client, for example, may be 
permitted access to the Sample Submission menu, the Study Management mrau, the Data Viewing 
menu, and the Data Analysis menu. Various employees of the Service Provider may be permitted 
access to the Sample Receiving menu, the Sample Processing menu, and the Barcode Management 
menu. 

Sample are submitted in units called ''submissions*' and each submission is assigned a unique 
submission code. Each submission may have multiple samples, each of which may be assigned a 
unique barcode. Prior to shipment, electronic submission is made via the Software System. Unique 
codes are assigned to each submission and to each sample within it for ease of tracking. Preferably, 
the barcodes are printed and afiBxed to the samples by the Client, but they may instead be printed 
and affixed by the Service Provider upon receipt. Also, preferably, each sample shipment should 
include a unique submission code. Each sample contained in the shipment should be uniquely 
identified either by a distinct client-issued identifier or, preferably, by an affixed barcode provided 
by the Service Provider. Upon receipt by the S^ice Provider, each shipment is identified by the 
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enclosed submission code. The samples in the shipment may then be matched to the samples listed 
in the electronic submission to verify that all samples were successfully received. 

The electronic submission of the sample via the System Software occurs prior to processing. 
This electronic submission preferably is done remotely by the Client prior to shipment, but may 
instead be done by the Service Provider upon receipt. Electronic sample submission preferably 
includes entry of the Client Name, the Work Station Operator Name, and the Sample Type. Data 
for each sample may be entered directty into the System Software or may be imported from an 
electronic spreadsheet such as Microsoft Rccel or a database. The data entered for each sample 
preferably includes a Sample Number, a Patient/Organism ID, a Tube Identifier, and the Number 
of Units (/.e., the number of identical samples). A barcode may be generated by the System 
Software for each submitted sample. Once the data is entered it may be submitted over a network 
to the Sarvice Provider. Further, the bar codes may be printed by the Client and ^lied to each 
sample. 

The software preferably provides a tracking number by which the shipping status of the 
samples may be monitored. For example, a Federal Express tracking number may be assigned by 
the System Software to each sample submission. The status of the submission may flien be viewed 
by opening the saved submission information and selecting the tracking number of interest. In this 
way the Client may determine if the samples are in transit or have been received by the Service 
Provider and entared into the sample database. 

Upon receipt of the samples, the Service Provider accesses the System Software and entas 
a Receiving Operator Code as well as the Submission Code that was assigned when the samples 
were electronically submitted. Storage and substorage information may also be entered. If the 
Client did not apply barcodes, then barcodes may be generated and printed by the System Software 
and applied by the Receiving Operator at this time. Finally, the samples are placed in the assigned 
storage and substorage facilities and the sample database is updated. 

Studies and Projects 

The System Software may be designed to implement both studies and projects, where the 
term "study" represents a unit of scientific inquiry defined by the Client and the term "projecf 
rq)res«its a unit of e:qperimental data. A project is defined by a set of samples and a multiplexed 
biological assay that is perfonned on them within the context of a particular study. For example, 
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three studies (Study 1, Study 2. and Study 3) could be based on three sample sets (Sample Set 1, 
Sample Set 2, Sample Set 3, respectively). Each study could be directed toward a subset of four 
assays (Assay 1, Assay 2, Assay 3, and Assay 4). Study 1 may include Project A directed toward 
Assay 1 and Project B directed toward Assay 2. Study 2 may include Projects C, D, and E directed 
toward Assays 2, 3, and 4, respectively. Finally.Study 3 may consist ofProjectF directed to Assay 
3. Each project or study may be in any one of the following four stages, each of which is optionally 
associated with the indicated color code: (1) Created [blue], (2) Initiated [yellow], (3) In Process 
[orange], and (4) Completed [green]. 

Study management tools may be provided to allow the Client to create studies, initiate 
projects, and monitor the progress of the various projects and studies. In addition. Customers may 
be permitted to modify the projects and studies based on, for example, earlier obtained results. 
These tools may be initiated by a Study Management Window that lists all of the Customer's 
projects, either pending or completed. The Client may indicate on this screen those projects that are 
of interest. 

To create a new study, the Client may choose the New Stady option fiom the Study 
Management Menu. A window may then appear showing the Client name and permitting the Client 
to enter a name for the study, an objective for the study, and an experimental plan for the study. The 
new study will then appear in the CUent/Study/Project Tree, which lists the projects associated with 
each study and the studies associated with tiie particular Client. 

As an example, and referring to FIG. 8, a genotyping project may be entered according to 
the following procedure: (1) Select "Genotyping" as the Project Type in Field 510. (2) Enter the 
Project Name in Field 51 5. (3) Enter the Project Objective m Field 520. (4) Select the Bio Test in 
Field 525. (5) Select the Priority in Field 530. (6) Enter the Due Date in Field 535. The Request 
Date will be entered automatically into Field 540. As the project progresses, its status will be shown 
in Field 545 and the Start Date and Complete Date will be updated in Fields 550 and 555, 
respectively. The project may be created and saved without initialization or the project may be 
initialized during the creation process. To initiate the project, the Client accesses the Sample 
Submission menu to display a list of barcoded samples in Panel 560. The samples may then be 
chosen by barcode identifier, imported into the newly created project, and tiansfcnred to Panel 565. 
At this time the project may be initiated. Initialization of a Gene Expression project proceeds in a 
similar manner except that "Expression" is selected as the Project Type in Field 510. The status of 
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a project may be viewed at any time. A study is typically considered to be completed when the 
desired scientific objective has been accomplished. 

Project data may be viewed by the Client using the System Software as indicated in FIG. 9. 
TheData Viewing Window in FIG. 9 is divided into three main sections: (1 ) the Client/Study/Proj ect 
Tree and Sample List in Section 600; (2) an Amy Image Sample Information area in Section 605 
designated for displaying detailed information about selected samples; and, (3) a Score Area in 
Section 610 for displaying allele scores for genotyping or expression scores for gene expression. 
The Client selects a Study and aPioject in Tree Area 615 of Section 600. After apioject is selected, 
the System Software retrieves the requested information firom the server (or servers) and displays 
the sample set in Sample List Area 620 of Section 600. An array image will be shown in Section 
605,andthescoreswiU be displayed in Section 610. Ifa particular sample is selected, then detailed 
sample information may be shown along with tbs score data . 

Anah/sis 

The Systran Software preferably provides analysis tools by which project data may be 
analyzed. Each table of information containing data generated within a project is analyzed 
individually, with each row in a project table corresponding to an mdividual sample. The columns 
may contain experimental data such as expression levels or allele scores, clinical or biological 
information about the samples, or sample identifiers such as the sample barcode identifier. Tools 
may be provided to analyze the data in the project table. Referring to HG. 10, such tools may 
include: (1) Interactive Query Builder 650 allowing the Client to interactively build queries for 
selecting subsets of data for analysis; (2) Analytical Spreadsheet 65 5 aUowing the CUent to manually 
select subsets of data and to apply data analysis algorithms; and, (3) Interactive Visualizer 660 
allowing the Clientto interactively visualize selected dataand the results of data analysis. Datafiom 
Project Table 665 may be transported sequentially from one tool to another or exported as Local File 

670 as shown in FIG. 10. 

Interactive Query Builder 650 preferably allows users to select rows (corresponding to 
samples) from Project Table 665 via several search techniques. For example, a fast search might 
allow flie Client to select columns of data that will be used to formulate the search criterion. These 
selected columns could then be uploaded into Analytical Spreadsheet 655. A more detailed 
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searching method could be provided in which a detailed query using Boolean operations could be 
created and stored for later reuse. 

Analytical Spreadsheet 655 allows manual selection of subsets of data from Project Table 
665. The selected subsets of data may then be used as input to analytical algorithms and 
visualization tools, or may be exported in the desired format for analysis using outside programs. 
Preferably, data may be selected for use or export by rectangular region, as well as by columns or 
by rows. Further, analytical algorithms may be generated directly from the spreadsheet by selecting 
the desired algorithm and the required input variables. For example, genotyping and outcome 
information could be analyzed using two-variable (odds ratio) and three-variable (Mantel-Haenszcl) 
methods. As another example, genotyping and outcome mfoimation could be analyzed using a 
supervised learning algoritimi that discovers predictive models (probabilistic rules) for predicting 
clinical outcomes from the genotyping information. Data from the analytical spreadsheet preferably 
may be exported into a local file using a variety of standard formats. 

Data from Analytical Spreadsheet 655 may be transferred to Interactive Visualizer 660 for 
visual display as, for example, a scatter plot or a curve plot A scatter plot may be provided for 
visualization of three selected columns: two numeric columns and a third column containing a small 
number (typically up to 16) of distinct values. Each dot in the scatter plot would then correspond 
to a row in the spreadsheet (/.e, to an individual sample). The color of the dot could then indicate 
the vahie in the third column. For example, a scatter plot could be used to visualize a gene 
expression project in which a number of toxic and non-toxic compounds are applied to a particular 
cell line. Qusteringofdots would then indicate distinct expression profiles. By selecting toxicity 
as the third column, one could then detemiine if tfie known toxicity of the admmistered compounds 
correlates with the clustering, Le., with the expression profiles. 

A curve plot may be used for simultaneous visualization of multiple curves. The horizontal 
axis would typically be chosen to represent time, dosage, or some other numeric value. Multiple 
gene expression levels or other numeric values may be selected for multi-color display along the 
vertical axis. For example, a curve plot could be used to visualize agene expression project in which 
the gene expression of a number of genes is simultaneously measured in a cell line at multiple time 
points after treatment with a particular compound. Time-dependent variation in gene expression 
levels of multiple genes may then be simultaneously visualized by displaying time on the horizontal 
axis and the measured gene expression levels for genes of interest along the vertical axis. As another 

.27- 

8/2/2005, EAST Version: 2.0.1.4 



wo 01/31333 



PCTAJSOO/29326 



example, a curve plot may be used for a gene expression project in which the gene expression of a 
number of genes is simultaneously measured in a cell line in response to treatment with different 
dosages of a particular compound. Dose-dependent variation in gene expression levels of multiple 
genes may be simultaneously visualized by displaying dosage along the horizontal axis and the 
measured gene expression levels for the genes of interest along the vertical axis. 

Production 

As described above, a project is a unit of experimental data collection, defined by a set of 
samples and a multiplex assay that is performed on the set of samples. Samples used in a project 
initially may be present in individual tubes or arrayed in microtiter plates. If the samples are not 
arrayed, the firet step in a project involves the plating (arraying) of samples and the normalization 
of sample concentration. The subsequentprocessing of samples may then be tiacked on a plale-by- 
plate basis. Plates (containing samples at different processing stages) may be imported ftom one 
project into the otiier. Samples may be tracked on a plate-by-plate basis even during imaging, dot- 
scoring, allele-scoring, and expression-scoring steps — well after the physical plates have ceased 
to exist in the process of converting physical objects into information. 

Each project typically consists of a number of processing steps (typically about 7) depending 
on the type of project (genotyping, gene expression, or proteomics). A typical genotyping project 
is shown in FIG. 1 1. A Production Manager will typicaUy create woric orders for each individual 
step and give the work orders to Workstation Operators. A Workstation Operator receiving a work 
order will typically enter the work order barcode into the workstation, enter the plate barcode, and 
initiate the process. The workstation communicates over a network witii tiie database, receives 
details regarding the operation, performs the operation, and reports the successful completion. 

Upon completion of an oi^eration, the Production Manager will log the completion of the 
work orders. A work order may contain multiple sample plates. Each such plate may be in one of 
three states: In Process. Completed, or Failed. Each state is preferably represented by a different 
color code. 

The System Software may be used to issue a work order via a Production Menu as shown 
in FIG. 12. Source Project Window 700 displays initiated projects that are not yet completed. 
Detailed information about each project may be displayed in Project Information Window 705 and 
Work Order Step Window 710. When applicable, individual samples may be listed in Project Input 
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Window 715. For example, a project may contain completed plates in its first three steps: DNA 
plating and normalization, sample PGR, and target preparation. Since the DNA plating and 
normalization step is part of the project, the project was initiated with individual samples, which 
would he shown in Project Input Window 715. Iftheprojectwere initiated witiiPCRplates instead, 

5 thai DNA column 720 and PGR cohimn 725 in Work Order Step Window 710 may be shaded. 
Since Target Plate Column 730 is present in Work Order Step Window 710, flie work order for 
hybridization is ready to be issued. 

The Hybridization Step begins with the importation of source plates into a woric order, as 
shown in FIG. 1 3, which displays a dialog box suitable for use with processing stq>s where physical 

10 plates are imported and created in the course of operation (eg., PGR and target preparation). 
Different dialog boxes may be used for steps that do not use imported plates {eg., plating and 
normalization) and steps where the output plate is not a physical plate containing samples but plate- 
related infonnation (e.g.. hybridization, imaging, dot scoring, and bio scoring). After source 
plates are imported, a Woric Orda Window witii plate information may be presented as shown in 

1 5 FIG. 14. This window may be used to select a Woric Station Operator, to assign a due date, and to 
assign a priority. It also may be used to print barcodes or to print the work ordar itself. A 
completion date may be entered to log the completion of a work order. 

The Syston Software may be used to view the status of work orders, plates, and samples 
using the Production Menu of FIG. 12. Selecting aparticularplateinFIG. 12 displays a Plate Detail 

20 Window as shown in FIG. 15. Selecting a particular sample in HG. 12 displays a Sample Detail 
Window as shown in FIG. 16. 

In the bio-scoring step, the DotScores are converted into biologically meaningfiil quantities 
(BioScores). A BioScore in a goiotyping project is a genotype value (L e., an allele scoreX whereas 
in a gene expression project it indicates a gene expression level. A BioScore computation may be 

25 initiated using the dialog in FIG. 1 7. A project is completed when a bio-scoring plate is completed. 
At that point, the data firom the project is ready for viewing and analysis. 

Description of the Process Flow 
Referring to FIG. 18, a study or project is initiated when Client 105 uses Systran Software 
30 805 over Intemet 1 15 and submits Array Design Client Order 810. Schedulmg and Wotk Order 
Control 820 generates Array/Assay Design Woric Order 825 that results in Document Genoation 
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830 and Design Approval 835. Array/Assay Design Work Order 840 is then transferred to 
Purchasing/Receiving 840, which handles Material Check-In 845, Receiving and Inspection 850, 
and Oligo Storage 855. Client 105 again uses System Software 805 over Internet 1 1 5 and submits 
Sample Submission Order 860. The resulting Sample Acquisition stage 865 involves Sample 
Check-In 870, Sample Receiving and Inspection 875, and Sample Storage 880. Client 105 uses 
System Software 805 over Internet 1 1 5 and submits Sample Processing Order 885. Scheduling and 
Work Ord^- Control 820 generates Sample Retrieval Work Order 890 resulting in Sample Retrieval 
895, Sample Processing 900, and Sample Storage 905. Next, Surface Preparation Work Order 910 
is generated, resulting in the Material Kitting 915, Surface Preparation 920, Silanization Inspection 
925, and Release to Stock/ Scrap 930. Array Fabrication Work Order 935 results in Material Kitting 
940, Array Fabrication 945, Array Fabrication Inspection 950, and Release to Stock/Scr^ 955. 
Sample Processing Woric Order 960 results in Material Kitting 965, Labcling/PCR 970, 
Hybridization 975, Imaging 980, and Dot/Bio Scoring 985, The results from Dot/Bio Scoring 985 
are transferred to Data Analysis Work Order 990 resulting in Data QC 995 and Data Release 1000 
to System Software 805 over Internet 1 15 and back to Client 105. 

Conclusion 

Although the processes described herein have used the Internet for the submission of the 
work order, the selection of the biological samples, and the delivery of data, the present invention 
is not limited to the use of the Internet For example, the samples and the work order may be 
submitted by mail or the results may be provided in the form of a computer file on a compact disc. 
The format of the computer file may be chosen so that the Client can import the data into its own 
analysis software. For example, if the Client analyzes the data using an electronic spreadsheet 
program, then the data may be supplied over the Internet or on a compact disc in the form of an 
electronic spreadsheet 

The present invention, therefore, is well adapted to cany out the objects and obtain the ends 
and advantages mentioned above, as well as others inherent herein. All presently preferred 
embodiments of the invention have been given for the purposes of disclosure. Where in the 
foregoing description reference has been made to elements having known equivalents, then such 
equivalents are included as if they were individually set forth. Although the invention has been 
described by way of example and with reference to particular embodiments, it is not intended that 
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this invention be limited to those particular examples and embodiments. It is to be understood that 
numerous modifications and/or improvements in detail of construction may be made that will readily 
suggest themselves to those skilled in the art and that are encompassed within the spirit of the 
invention and the scope of the appoided claims. 
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Claims 

I claim: 



1 1 . A method of requesting genomics services from a service provider and providing genomics 

2 services to a client comprising: 

3 under control of said client, 

4 . providing one or more biological samples; 

5 idaitifying one or more genome sequences; 

6 under control of said service provider, 

7 obtaining said samples and said genome sequences; 

g providing one or more microanays wherein each of said microarrays contains at 

9 least one of said genome sequences; 

10 applying at least one of said biological samples to at least one of said 

\ I microarrays; and, 

1 2 under control of said client, 

13 receiving data representative of said applying step over the Internet. 

1 2- The method of Claim 1 wherein said genome sequence comprises a single nucleotide 

2 polymorphism. 

13. The method of Claim 1 wherein said step of providing one or more biological samples 

2 comprises: 

3 under control of said service provider, 

4 providing a repository of biological samples; 

5 providing a catalog of said repository; 

6 under control of said client; 

7 accessing said catalog; and, 

g selecting said biological samples, from said catalog. 

1 4. The method ofClaim 3, wherein said steps ofproviding a catalog and accessing s^^ 

2 occur over an Internet connection between said client and said service provider. 
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5. The method of Claim 4 wherein said Internet connection is a secure Internet connection. 

6. The method of Claim 1 wherein said one or more biological samples comprises tissue 
samples. 

7. The method of Claim 1 wherein said one or more biological samples con^rises DNA 
samples. 

8. The method of Claim 1 further comprising the step of analyzing said data representative of 
said applying step. 

9. The method of Claim 8 wherein said analyzing step occurs under control of said client. 

10. The method of Claun 8 wherein said analyzmg step occurs under control of said service 
provider. 

11. The method of Claim I wherein said data representative of said applying step comprises 
genotype data. 

12. The method of Claim 1 wherein said sXep of providing one or more biological samples 
comprises transporting said samples from said client to said service provider. 

13. The method of Claim 12 wherein said data representative of said applying siep comprises 
gene expression data. 

14. The method of Claim 12 wherein said one or more biological samples comprises total RNA 
samples. 

15. The method of Claim 12 wherein said one or more biological samples comprises poly-A 
RNA samples. 
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1 16. A method of requesting proteomics services from a service provider and subsequently 

2 providing proteomics services to a client comprising: 

3 under control of said client, 

4 providing one or more biological samples; 

5 identifying one or more antigens; 

6 under control of said service provider, 

7 obtaining said samples and identified antigens; 

8 providing one or more microarrays wherein each of said microarrays contains 

9 probes to detect at least one of said identified antigens; 

1 0 applying at least one of said biological samples to said microaiiays; and, 

1 1 under control of said client, 

12 receiving data representative of said applying step over the Internet 

1 17. The method of Claim 16 wherein said step of providing one or more biological samples 

2 comprises: 

3 under control of said service provider, 

4 providing a biological sample repository; 

5 providing a catalog of said biological repository; 

6 under control of said client; 

7 accessing said catalog; and, 

8 selecting said biological samples from said catalog. 

1 18. The method of Claim 17, wherein said steps ofproviding a catalog and accessing said catalog 

2 occur over an Internet connection between said client and said service provider. 

1 19 The method of Claim 1 8 wherein said Internet connection is a secure Internet connection. 

1 20. . The method of Claiml or Claim 16 further comprising applying a unique identifier to each 

2 . of said one or more biological samples. 
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I 21. The method of Claim 20 wherein said applying step is performed under control of said client. 

1 22. The method of Claim 20 wherein said applying step is performed under control of said 

2 service provider. 

1 23. The method of Claim 20 fiirthw- comprising tracking said one or more biological samples 

2 using said unique identifier. 

1 24. The method of Claim 23 wherein said tracking step is performed under control of said client 

1 25 . The method of Claim 23 wherein said tracking step is performed under control of said service 

2 provider. 

1 26. A process for remotely selecting samples from a biological repository comprising: 

2 under control ofa service provide, 

3 providing a database of said samples in said biological repository; 

4 providing a network connection to said database accessible by a client; 

5 under control of a client; 

6 accessing said database over said network; and, 

7 selecting a subset of said samples from said biological repository. 

1 27. The process of Claim 26 wherein said network is the Intemet 

1 28. The process of Claim 26 wherein said database includes clinical records corresponding to 

2 at least a portion of said samples. 

1 29. The process of Claim 26 wherein said database includes phenotype inforaiation 

2 corresponding to at least a portion of said samples. 

1 30. The process of Claim 26 wherein said database includes follow-on medical history 

2 information corresponding to at least a portion of said samples. 
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1 3 L A process for remotely conducting a genomics experiment comprising: 

2 undo: control of a service provider; 

3 providing a database of biological samples in a biological repository; 

4 . providing a networic connection to said database accessible by a client; 

5 under control of a client; 

6 accessing said database over said network; 

7 selecting a subset of said samples from said biological repository; 

8 identifying a set of genomic sequences; 

9 under control of said service provider; 

10 detOTnining if said genomic sequences are present in said samples; and, 

11 informing said client of results of said determining step. 

1 32. The process of Claim 3 1 further comprising under control of said service provider: 

2 modifying said set of genomic sequences subsequent to said uiforming step. 

1 33. The process of Claim 31 furthercomprising identifying said samples with unique identifiers. 

1 . 34. The process ofClaim 33 fiirthar comprising 

2 under control of said clirat: 

3 selecting a sample from said subset of samples; 

4 determining a unique identifier corresponding to said selected sample; 

5 requesting status information regarding said identifier; 
6 . under control of said service provider 

7 determining status of said identifier, and, 

g . informing said client of said status. 

1 35. A method ofproviding genomics services to a client comprising: 

2 receiving one or more biological samples firom said client; 

3 receiving one or more genome sequences fix>m said client; 

4 providing one or more microairays wherein each of said microarrays contains at least one 
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5 of said genome sequences; 

6 implying at least one of said biological samples to at least one of said microarrays; and, 

7 transmitting data representative of said 2«)plying step to said client over the Internet. 

1 36. A method of providing genomics services to a client comprising: 

2 receiving one or more biological samples from said client over the Internet; 

3 receiving one or more genome sequences from said client; 

4 providing one or more microarrays wherein each of said microarrays contains at least one 

5 of said genome sequences; 

6 applying at least one of said biological samples to at least one of said microarrays; and. 

7 transmitting data representative of said applying step to said client 

1 37. A method of providing genomics services to a client comprising: 

2 receiving one or more biological samples from said client; 

3 receiving one or more genome sequences from said cli«at over the Internet; 

4 providing one or more microarrays . wherein each of said microairays contains at least one 

5 of said genome sequences; 

6 applying at least one of said biological samples to at least one of said microarrays; and, 

7 transmittmg data representative of said applying step to said client. 

1 38. A method for providing experimental biological services to a client comprising: 

2 receiving a woric order from said client comprising a biological sample portion listing 

3 one or more biological samples and an assay portion listing one or more 

4 experiments based on said biological samples; 

5 performing said one or more experiments on said one or more biological samples; and, 

6 transmitting data representative of said performing step to said client. 

1 39. The method of Claim 38 wherein said biological sample portion of said woric order 

2 comprises one or more biological samples submitted by said client. 
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1 40. The method of Claim 38 wherein said biological sample portion of said work order 

2 comprises one or more pointa:s to records in a biological sample database. 

1 41. The method of Claim 38 whorein said assay portion of said work order comprises an 

2 experimental protocol and a specification of one or more probes. 

1 42. The method of Claim 41 wherein said performing stq) fiarther comprises: 

2 providing one or more microairays onto which said one or more probes is deposited; and, 

3 intorogating said one or more biological samples with said one or more microarrays. 

1 43. The method of Claim 42 wherein said one or more probes are immobilized to a substrate. 

1 44. The method ofClaim 43 wherein said one or more probes are immobilized on a flat surface. 

1 45. The method of Claim 43 wherein said one or more probes are immobilized in a three- 

2 dimensional polymer. 

I 46. ThemethodofClaim43whereinsaidoneormoreprobesareimmobilizedonbeadsurfaces. 
1 . 47. The method of Claim 42 wherein said one or more probes are in solution- 

1 48. The method of Claim 38 wherein said receiving step further comprises receiving said 

2 biological sample portion of said woric order &om a remote location. 

1 49. The method of Claim 3 8 wherein said receiving step further comprises receiving said assay 

2 portion ofsaid work order from a remote location. 

1 50. The method ofCUdm 38 wherein said transmitting step forttier comprises transmitting said 

2 data to a remote location. 
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1 51. The method of Claim 38 wherein said receiving step further comprises receiving said 

2 biological sample portion of said work order over the Internet 

1 52. The method of Claim 38 wherein said receiving step fiirther comprises receiving said assay 

2 * portionof said work order over the Internet. 

1 53. The method of Claim 38 wherein said transmitting step furdier comprises transnaitting said 

2 data over the Internet. 

1 54. The method of Claim 38 wherein said one or more biological samples comprises tissue 

2 samples. 

1 55. The method of Claim 38 wherein said one or more biological samples comprises DNA 

2 samples. 

1 56. The method of Claim 38 wherein said one or more biological samples comprises total RNA 

2 samples. 

1 57. The m^od of Claim 38 wherein said one or more biological samples comprises poly-A 

2 RNA samples. 

1 58. The method of Claim 38 wherein said assay portion of said work order comprises a list of 

2 genes. 

1 59. The method of Claim 38 wherein said assay portion of said work order comprises a list of 

2 single nucleotide polymorphisms. 

1 60. The method of Claim 38 wherein said assay portion of said work order comprises a list of 

2 . proteins. 
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